Korean Document Classification Using Extended Vector Space Model

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Document Vector Space Representation Model for Automatic Text Classification

Classification of text documents presents a unique challenge to conventional classification algorithms. Due to the existence of large number of features in the datasets, providing a desired representation for text documents can be seen as another problem. In this paper a simple but effective representation model for text documents to tackle the classification problem is discussed. Two different...

متن کامل

Text Document Pre-Processing Using the Bayes Formula for Classification Based on the Vector Space Model

This work utilizes the Bayes formula to vectorize a document according to a probability distribution based on keywords reflecting the probable categories that the document may belong to. The Bayes formula gives a range of probabilities to which the document can be assigned according to a pre determined set of topics (categories). Using this probability distribution as the vectors to represent t...

متن کامل

Entity-Based Cross-Document Coreferencing Using the Vector Space Model

Cross-document coreference occurs when the same person, place, event, or concept is discussed in more than one text source. Computer recognition of this phenomenon is important because it helps break "the document boundary" by allowing a user to examine information about a particular entity from multiple text sources at the same time. In this paper we describe a cross-document coreference resol...

متن کامل

Document summarisation based on sentence ranking using vector space model

WWW is a repository of large collection of information available in the form of unstructured documents. It is a challenging task to select the documents of interest from such a huge document pool. To fasten the process of document retrieval, text summarization technique is used. Ranking of documents is made based on the summary or the abstract provided by the authors of the document. But it is ...

متن کامل

Document Ranking and the Vector-Space Model

Using several simplifications of the vector-space model for text retrieval queries, the authors seek the optimal balance between processing efficiency and retrieval effectiveness as expressed in relevant document rankings. fficient and effective text retrieval techniques are critical in managing the increasing amount of textual information available in electronic form. Yet text retrieval is a d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The KIPS Transactions:PartB

سال: 2011

ISSN: 1598-284X

DOI: 10.3745/kipstb.2011.18b.2.093